AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Inference Acceleration

# Inference Acceleration

Mera Mix 4x7B
Apache-2.0
mera-mix-4x7B is a Mixture of Experts (MoE) model with half the scale of Mixtral-8x7B but comparable performance and faster inference speed.
Large Language Model Transformers
M
meraGPT
2,375
19
Prosparse Llama 2 7b
A large language model based on LLaMA-2-7B with activation sparsification, achieving high sparsity (89.32%) while maintaining original performance through the ProSparse method
Large Language Model Transformers English
P
SparseLLM
152
15
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase